Robust Text Segmentation in Low Quality Images via Adaptive Stroke Width Estimation and Stroke Based Superpixel Grouping

نویسندگان

  • Anna Zhu
  • Guoyou Wang
  • Yangbo Dong
چکیده

Text segmentation is an important step in the process of character recognition. In literature, there have been numerous methods that work very well in practical applications. However, when an image includes strong noise or surface reflection distraction, accurate text segmentation still faces many challenges. Observing that the stroke width of text is stable and significantly different from that of reflective regions generally, we present a novel method for text segmentation using adaptive stroke width estimation and simple linear iterative clustering superpixel (SLIC-superpixel) region growing in this paper. It consists of four following steps: The first is to normalize image intensity to overcome the influence of gray changes. The second utilizes the intensity consistency to compute normalized stroke width (NSW) map. The third is to estimate the optimal stroke width through searching for the peak value of the histogram of normalized stroke width, the text polarity is also determined. Finally, we propose a local region growing method for text extraction using SLIC-superpixel. Unlike current existing methods of computing stroke width, such as gray level jump on a horizontal scan line and gradient-based SWT methods, the proposed method is based on the statistics of stroke width in the whole image. Hence the stroke width estimation is not only invariant in scale and rotation, but also more robust to surface reflection and noise than that of those methods based only on the pairs of sudden changes of intensity or gradient maps. Experiments with many real images, such as laser marking detonator codes, notice signatures and vehicle license plates, etc., have shown that the proposed algorithm can work well in noised images and also achieve comparable performance with current state-of-the-art method on text segmentation from low quality images.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Directional Stroke Width Transform to Separate Text and Graphics in City Maps

One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...

متن کامل

Stroke Width-Based Contrast Feature for Document Image Binarization

Automatic segmentation of foreground text from the background in degraded document images is very much essential for the smooth reading of the document content and recognition tasks by machine. In this paper, we present a novel approach to the binarization of degraded document images. The proposed method uses a new local contrast feature extracted based on the stroke width of text. First, a pre...

متن کامل

Robust Potato Color Image Segmentation using Adaptive Fuzzy Inference System

Potato image segmentation is an important part of image-based potato defect detection. This paper presents a robust potato color image segmentation through a combination of a fuzzy rule based system, an image thresholding based on Genetic Algorithm (GA) optimization and morphological operators. The proposed potato color image segmentation is robust against variation of background, distance and ...

متن کامل

A hierarchical Convolutional Neural Network for Segmentation of Stroke Lesion in 3D Brain MRI

Introduction: Brain tumors such as glioma are among the most aggressive lesions, which result in a very short life expectancy in patients. Image segmentation is highly essential in medical image analysis with applications, particularly in clinical practices to treat brain tumors. Accurate segmentation of magnetic resonance data is crucial for diagnostic purposes, planning surgical treatments, a...

متن کامل

A Novel Stroke Width Based Binarization Method to Handle Closely Spaced Thick Characters

Signboards and billboards provide a challenge to image seg¬mentation methods, since these images may also have pictures and graphical objects, apart from text objects. Methods that often succeed in more traditional text block segmentation situations do not perform well here since estimation of text lines and character widths etc fail due to the short sample sizes. Further, extraction of charact...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014